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Abstract. The well-known O (n 1_1 / d ) behaviour of the optimal tour length 
for TSP in d-dimensional Cartesian space causes breaches of the triangle in- 
equality. Other practical inadequacies of this model are discussed, including 
its use as basis for approximation of the TSP optimal tour length or bounds 
derivations, which I attempt to remedy. 



1. Introduction 

Studies of the behaviour of the optimal tour value in respect of the TSP has 
been in place since the fifties. Considerate work has been done with the O (n 1_1 / d ) 
model of behaviour based on results of Beardwood, Halton and Hammersley |Q. 
The approach taken here is one of simple real analysis and some of the intermediate 
results correspond with those of Snyder and Steele || and Bern and Eppstein Q . 

As pointed out by Bern and Eppstein Q, an a priori bound on a geometric 
graph is a bound that depends on the assumption that all vertices lie within a 
given container. Rather than bounds, I consider the behaviour of the optimal tour 
length of a geometric graph in a 2D rectangular region, which is transformed to a 
square and then simplified (extended) to d-dimensional cube. This is compared to 
the behaviour of the MST length in the same container space. 

The problem of studying the behaviour or approximation of the optimal tour 
length of TSP instances (||0||) can be illustrated by an example: Given two TSP 
instances with substantially different number of vertices have the same optimal tour 
length. 

It is shown that the general behaviour of ||0|| does not depend on the number 
of vertices. I also give experimental evidence in support of this using a small set of 
TSPLIB H instances, and conclude that the commonly cited O behaviour 
is inadequate in both description of behaviour (nor useful bounds) of ||0|| . A simple 
alternative is suggested. 

The present is an extended version of a mention in ||. 

2. Terms and Definitions 

The Minimum TSP in Cartesian space is a special case of the Minimum Weight 
Hamiltonian Cycle problem. Both are specified as follows: 
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P# 1 (Minimum Hamiltonian Cycle). Given G — (V,E) with a cost function 
d : E — > R, find solution O — (Vo, Eo) C G such that 

(a) O is a Hamiltonian cycle; 

(b) ||0|| = Yle-E ^( e ) * s minimum; 

P# 2 (Minimum TSP in Cartesian Space). P#|J and 

(a) Vv : V ■ coords (v) € R d and 

(b) V(it, v) : E ■ d(u,v) — \coords (u) — coords (v) \ 
where coords : V — > R d are Cartesian. 

I (unnecessarilly) split the problem specification this way for my personal con- 
venience in what follows (separating construction and geometric aspects of the 
problems), and for the following observations (from P=#]l]): 

(a) \V\ = \V \ = \E \; 

(b) Solution bounds: 

(1) nwo < \\0\\ < nwi 
where n = \Eo\, 

Wo = min{rf(e)|e G E} A w± = max{c?(e)|e G E} 
The notation used is fairly standard. I have attempted to maintain consistent 
naming, e.g. Oabci is the value (approximate) of the exact ||0||, and the subscript 
i is the number of 'alternative' formulation of Oabc, e.g. Obi is an alternative to 
Ob 2 . 

3. Two Similar Formulations of d-Dimensional ||0|| Behaviour 

Consider a rectangle (being a container space of the TSP graph and respective 
tour) and that we have a uniform grid tight within (and on) that rectangle with 
size a by b points, assuming without loss that a is even. The optimal tour value 
and the diagonal of the rectangle are: 

(2) ||0|| = a ■ b ■ wq — n.wo 

(3) wi = W(a-l) 2 + (&-l) 2 
Dividing: 

(4) noil 



wi ^J(a~iy + (b-l) 2 
If a = b than we have a square, and ||0||/u>i = nj {{y/n — 1) V2)- To carry this 
to Cartesian d-dimensions, (Q) and (||) are changed to 0% — wq IliLi a i = n ' w 
and Wi = wq I J2i=i ~ I) 2 ) • Assuming Vi, j : 1 . . . d • Oj — aj then 

Ox _ n 

wi (n x / d - 1)^ 

Note that 0\ differs from the commonly cited 0(r\}-~ 1 l d ) optimal tour length 

1/2 

behaviour, which can be formulated by taking w\ ~ wo (J2ti d a « 2 ) (under the 
assumption that a is large), then: 

O2 _ n l - l ' d 
wi Vd 
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It should be noted that in the unit d-dimensional cube wi = \fd. 

For a metric (non-topological) graph ||0|| > 2w\ must hold ||. Both 0\ and 
O2 fail to preserve the triangle inequality when d = 1, due to the "spatial" nature 
of the derivations (no container) . It is also evident that 0\ preserves the triangle 
inequality for any n > 2, whereas O2 fails in doing so when n 1 ~ 1 / d < 2\fd. (There 
is nothing to prevent such instances from occuring.) The ratio of the two estimates 
as d — > 00 is: 

(5) eM = 2a = i__L^ 

The divergent behaviour of 0\ and O2 is fixed by imposing the condition n > 
2 d + k, k > in order to have a valid d-dimensional cube, and 1/2 < e (n, d) < 1. 
Either 0\ or O2 will fail in the description (if they are assumed to be descriptive) 
of the optimal tour length behaviour (see Fig.|l|). 

The MST length behaviour (||T||) is analogous @ to that of the TSP in a grid: 

||T|| = (n - 1) • w and T x /wi = (n - 1) / ((n 1 ^ - l)Vd) . \\0\\ can be expressed 
in terms of ||T|| (treated as known value computed exactly by Prim's algorithm, 
say): 

Oh = lirii H 



Ot 2 = \\T\\ 

As a matter of fact: 



n 1 ' 4 -l)Vd 

Wi 



nWy/d 



(6) ||O||>||T||+^ 



n 



(7) (2||T||>)||0||>||r|| 

71—1 

It is the case that for any metric graph ||T|| accounts for at least 50% of ||0|| and 
can be considered a major term in any attempted approximation. It is obvious that 

the second term in Ot\ may well lead to wq > w\/ (^(n 1 l d — l) \fdj , as n — > 00, and 

ultimately Ot\ — > ||T|| (so will Oti), which cannot be. This suggests that ||0|| does 
not depend on the number of vertices. Moreover, the e (n, d) term also remains 
in the relative difference in estimates for the remaining 50% of ||0|j, as seen from 
Oti and Ot 2 - In other words, if |jO|j were to depend on the number of vertices the 
following must be true: 

e{n ' d)= Oh~W\\ =o^||T||^0 

Obviously this is absurd, unless Ot\ — Ot2 t\0\ =02, which is not the case. 
Thus, it is difficult to contemplate using 0\ or O2 (see Fig. 2) as basis for behaviour 
description of ||0||. The OU's are slightly better (see Fig. 3), exhibiting an almost 
"bounding" behaviour, for which we already have (^) and (0) and the HK bound 
(see Fig.||). 

From the above one can conclude that n can do more harm than good in the 
analysis of[|0||. Setting n — 2 d (see above), yet another approximation (see FigJJ 
and Table.[j]) to ||0|| is obtained: 
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x 


mill 6 


rcicix 6 


nictx 6 ■ — min 6 


€-rms 


^1 





1668.14 


1668.14 


308 96 




-39.69 


1632 05 


1671.74 


299.91 


Oh 


—49.26 


0.76 


50.02 


14.16 


ot 2 


-49.28 


0.75 


50.02 


15.00 


Ota 


-14.64 


35.27 


49.91 


7.75 


OtC 2 


-32.32 


17.61 


49.93 


11.14 


HK* 


-9.73 


0.55 


10.28 


2.18 


\\T\\+w 


-49.98 





49.98 


16.25 




-49.98 





49.98 


15.48 



Table 1. Relative error ranges for the various formulas, e = 100- 
(X/\\0\\ - 1) %. (The largest HK* value +0.55 is possibly due to 
a misprint in ||]) 



3.1. Test Data. The test data consist of 85 instances in total of sizes 4 < n < 
2560, I have used 17 instances of my own and 68 instances from TSPLIB ^ of 
type EUC.2D, CEIL and ATT. The three norms were treated as EUC.2D without 
rounding. ATT are treated in the same way, as tour values were multiplied by y/l0 
to bring them in line with unrounded EUC_2D. The HK-bound values are taken 
from There were 5 TSPLIB with published shortest tour, for which I have been 
able to find shorter tours. These are given in Table.||. The HK bound is greater 
than the actual tour length for tsp225 and atH8 by 0.55% and 0.01% (see Table.||), 
respectively, possibly due to a typographical error (or the use of initial rounding 
in computing the HK relaxation). I do not consider accuracy (see Table]!) in this 
analysis of major importance, as for the 50 instances for which the optimal tour 
itself is not published, the data can be fixed (by point displacement) so the reported 
values are accurate (This is not the case with HK bound). 



Instances 




Size 


O 


\\o\\ 


HK 


17 


4 < 


n < 2560 


known 


exact 




18 


48 < 


n < 2392 


published 


exact 


reported 


50 


99 < 


n < 2319 


reported 


reported 


reported 



Table 2. TSP instances 



4. Conclusions 

Inadequacies of O (n 1_1 / d ) and similar formulations are demonstrated. A prob- 
abilistic or statistical 0] argument may show otherwise. 



REMARKS ON D-DIMENSIONAL TSP OPTIMAL TOUR LENGTH BEHAVIOUR 



5 



Instances 


1 1 l^G'pOT'i G.cL 1 1 


HK 


1 1 pupils fi/&d 1 1 


1 1 sJxoi^i&T 1 1 1 


eilbl 


426 


422.4 


429.983307 


429.117879 


st70 


675 


670.9 


678.597412 


677.194473 


eillOl 


629 


627.3 


642.309509 


640.42177 


chl3Q 


6110 


6074.6 


6110.859375 


6110.722217 


cM50 


6528 


6486.6 


6532.282227 


6530.906184 


atU8 


10628 


10602.1 


10601.1282 




tsp225 


3919 


3880.3 


3858.999064 





Table 3. Shorter tours. Values are as reported in || and com- 
puted from published tours in Q 



The optimal tour length does not depend on the number of vertices - consider 
again a grid example in a square: Let's suppose that we have found an optimal tour 
S = (Vs,Es) from G = {V,E) (a comb-like shape, say) and insert (in G) exactly 
one zillion equidistant points along each arc in Es C E of the optimal tour S to 
get Gz- If Sz is an optimal tour of Gz then ||5|| = \\Sz\\- Such point insertion is 
not inconsequential - the number of optimal solutions may be reduced, i.e. Gz has 
a lesser number of optimal solutions than G. 

I make no claims as to quality (if used as estimates) of the above given formulae, 
as I would not use any of them (see Table.|l|) as an estimate of ||0||. 0\ is an 
upper (easily proven so) bound, for any n > 2, however running the Nearest town 
algorithm ra to get us an upper bound within a factor of two is simpler. A simple 
dcrandomisation of the same algorithm yields an approximate solution within a 
factor of y/2 (provable under a geometric argument) for P#j2|in the plane. 
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Figure 1. Plots of e (2 d + k, d), for k = log 2 d,k= rf log2 d , k = d d , 
and k = d d+1 , for d = 1 . . . 100. 
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Figure 2 . Point histograms of relative error of 0\ and O2 - 85 instances 
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Figure 3. Point histograms of relative error of Ot\, Ot2 and HK 
bound (-line used for clarity) 
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FIGURE 4. Point histograms of relative error of Otc\ and Otc2 



ANDREI G. YANEFF 



- eps(IITOII+wO) • eps(HTOII*n/(n-l)) eps(HK) 





a 

« ** * ■ 






9 o 




£ 1 i >• i . 1 

.»•*.".-.* 

« ft ■ 

» a 






8 






a 







Figure 5 . Point histograms of the lower bounds 1 1 T 1 1 +wo , \\T\ 
and HK bound (-line used for clarity) 



